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Abstract 

Directed graphs can be partitioned in so-called passages. A passage P is a set of edges such that any 
two edges sharing the same initial vertex or sharing the same terminal vertex are both inside P or are 
both outside of P. Passages were first identified in the context of process mining where they are used to 
successfully decompose process discovery and conformance checking problems. In this article, we examine the 
properties of passages. We will show that passages are closed under set operators such as union, intersection 
and difference. Moreover, any passage is composed of so-called minimal passages. These properties can be 
exploited when decomposing graph-based analysis and computation problems. 
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^"» 1. Introduction 

Recently, the notion of passages was introduced in the context of process mining [3J. There it was 

O used to decompose process discovery and conformance checking problems [T|. Any directed graph can be 

partitioned into a collection of non-overlapping passages. Analysis can be done per passage and the results 

t-H can be combined easily, e.g., for conformance checking a process model can be decomposed into process 

J> fragments using passages and traces in the event log fit the overall model if and only if they fit all process 

C^ fragments. 

As shown in this article, passages have various elegant problems. Although the notion of passages is very 
simple, we could not find this graph notion in existing literature on (directed) graphs [31 Ej. Classical graph 
partitioning approaches [3 [8] decompose the vertices of a graph rather than the edges, i.e., the goal there 
is to decompose the graph in smaller components of similar size that have few connecting edges. Some of 
£f\ these notions have been extended to vertex-cut graph partitioning [SJ|5]. However, these existing notions 

are not applicable in our problem setting where components need to behave synchronously and splits and 
joins cannot be partitioned. We use passages to decompose a graph into sets of edges such that all edges 
sharing an initial vertex or terminal vertex are in the same set. To the best of our knowledge, the notion 
of passages has not been studied before. However, we believe that this notion can be applied in various 
domains (other than process mining). Therefore, we elaborate on the foundational properties of passages. 

The remainder is organized as follows. In Section [2j we define the notion of passages, provide alternative 
characterizations, and discuss elementary properties. Section [3] shows that any graph can be partitioned 
into passages and that any passage is composed of so-called minimal passages. Section [4] introduces passage 
graphs visualizing the relations between passages. Graphs may be partitioned in different ways. Therefore, 
Section [5] discusses the quality of passage partitionings. Section [6] concludes this article. 

2. Defining Passages 

Passages are defined on directed graphs, simply referred to as graphs. 

Definition 1 (Graph). A (directed) graph is a pair G = (V,E) composed of a set of vertices V and a set 
of edges E C V X V". 
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Figure 1: Graph G\ with 9 vertices, 12 edges, and 32 passages. 

A passage is a set of edges such that any two edges sharing the same initial vertex (tail) or sharing the 
same terminal vertex (head) are both inside or both outside of the passage. For example, {(a, &), (a,c)} is 
a passage in graph G\ shown in Figure [I] because there are no other edges having a as initial vertex or b or 
c as terminal vertex. 

Definition 2 (Passage). Let G = (V,E) be a graph. P C E is a passage if for any (x,y) £ P and 
{(x,y'),(x' ,y)} C E: {(x,y'), (x',y)} C P. pas(G) is the set of all passages ofG. 

Figure[2]shows 7 of the 32 passages of graph G\ shown in Figure]!] P2 = {(b, e), (6, /), (c, /), (c, d), (d, d), (d, /)} 
is a passage as there are no other edges having b, c, or d as initial vertex or d, e, or / as terminal vertex. 
Figure [2] does not show the two trivial passages: (no edges) and E (all edges). 
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Figure 2: Seven example passages of graph G\ shown in Figure [T] 

Lemma 1 (Trivial Passages). Let G = (V,E) be a graph. The empty passage and the full passage E 
are trivial passages of G. Formally: {0,-E} C pas(G) for any G. 

Some of the passages in Figure [2] are overlapping: Pq — P3 U P4 U P5 and P-i = Pi U P3 U P4. To combine 
passages into new passages and to reason about the properties of passages we define the following notations. 



Definition 3 (Passage Operators). Let G = (V,E) be a graph with P,P 1 ,P 2 C E. Pi U P 2; P\ n P 2 , 

P\ \ P2, Pi = P2, Pi 7^ Pi, P\ C P2, and Pi C P2 are defined as usual. tti(P) = {x \ (x,y) S P} are 
the initial vertices of P, ^(P) = {y | (x,y) e P} are the terminal vertices of P, Pi#P 2 if and only if 
Pi n P 2 = 0, Pi > P 2 if and only if 7T 2 (Pi) n tti(P 2 ) 7^ 0. 



Note that d is both an initial and terminal vertex of P 2 in Figure[2j 711 (P2) = {b, c, d} and ^2(^2) = {d, e, /}. 
P 5 #P 7 because P 5 n P 7 = 0. P 4 > Ps because 7r 2 (P 4 ) n 7Ti(P 5 ) = {h} ^ 0. 

The union, intersection and difference of passages yield passages. For example, P 7 = P\ U P3 U P 4 is a 
passage composed of three smaller passages. P 5 = P 6 \ P 7 and P 6 n P 7 = P 3 U P 4 are passages. 

Lemma 2 (Passages Are Closed under U, n and \). Let G = (V,E) be a graph. If Pi,P 2 € pas(G) 
are two passages, then Pi U P 2 , PiCi P 2 , and Pi \ P 2 are also passages. 

Proof. Let Pi,P 2 € pas(G), (x,y) e P1UP2, md{(x,y'), (x',y)} C P. We need to show that {(x,y'), (x', y)} C 
PUP 2 . If (x,y) eP 1 ,thcn{(x,y'),(x',y)}CP 1 CP 1 UP 2 . If (x,y) € P 2 , then {(x,y'), (x',y)} C P 2 C 
PUP 2 . 

Let Pi,P 2 € pa$(G), (x,y) G PinP 2 , and {(x,?/), (x',y)} C P. We need to show that {(x, ?/)> O^jS/)} £ 
Pi n P 2 . Since (x,y) G P x , {(x, y')> (^ 2/)} Q p i- Since 0>3/) G p 2, {(x,y'),(x , ,y)} C P 2 . Hence, 
{(x, 2 /'),(^,2/)}CPinP 2 . 

Let Pi,P 2 G pas(G), (x.y) G Pi\P 2 , and {(x, y'), (a;', y)} C P. We need to show that {(*,j/), (a^,j/)} C 
P \P 2 . Since (x,y) € Pi, {(x,y'), (z',2/)} C P x . Since (x,y) £ P 2 , {(i,i/'),(x',i/)}nF 2 = 0. Hence, 
{(x,y'),(x',y)}CP 1 \P 2 . a 

A passage is fully characterized by both the set of initial vertices and the set of terminal vertices. Therefore, 
the following properties hold. 

Lemma 3 (Passage Properties). Let G = (V, E) be a graph. For any P\,P 2 £ pas(G): 

• 7n(Pl) = 7Tl(P 2 ) & P = Pi «■ ^ 2 (Pl) = 7T 2 (P 2 ), 

• Pi#P 2 «*■ 7ri(Pi)n7ri(P 2 ) = 0, and 

• Pi#P 2 «■ 7r 2 (Pi)n^2(P 2 )=0- 

PROOF. X = 7Ti(P) implies P = {(x, y) G P | x G X} (definition of passages). Hence, 7Ti(Pi) = 7Ti(P 2 ) =4- 
Pi = P 2 (because a passage P is fully determined by 7i"i(P)). The other direction (-*=) holds trivially. A 
passage P is also fully determined by ir 2 (P). Hence, ^(Pi) = 7r 2 (P 2 ) =► Pi = P2. Again the other 
direction (4=) holds trivially. 

The second property follows from the observation that two passages share an edge if and only if the 
initial vertices overlap. If two passages share an edge (x, y), they also share initial vertex x. If two passage 
share initial vertex x, then they also share some edges (x,y). 

Due to symmetry, the same holds for the third property. □ 

The following lemma shows that a passage can be viewed as a fixpoint: P = ((tti(P) X V)U(Vx 7T2(P)))nP. 
This property will be used to construct minimal passages. 

Lemma 4 (Another Passage Characterization). Let G — (V,E) be a graph. P C P is a passage if 
and only if P = ((tti(P) xV)U(V x tt 2 (P))) n P. 

PROOF. Suppose P is a passage: it is fully characterized by 7Ti(P) and ir 2 (P). Take all edges leaving 
from tti(P): P = (tti(P) x V) D E. Take all edges entering tt 2 (P): P = (V x tt 2 (P)) n P. Hence, 

p = MP) x v) n p = (v x tt 2 (p)) n p. So, p = ((tti(p) xv)u(v x tt 2 (p))) n p. 

SupposeP= ((7Ti(P)xy)U(V r X7r 2 (P)))nP. Let (x,y) G P and {{x,y'), (x',y)} C P. Clearly, (x,y') € 
(7ri(P)xV)nPand(x',y) G (V r X7r 2 (P))nP. Hence, {(x,j/')» (s',S/)} C ((?ri(P) x V)U(VxTr 2 (P)))nE = P. 

a 



3. Passage Partitioning 

After introducing the notion of passages and their properties, we now show that graph can be partitioned 
using passages. For example, the set of passages {Pi, P 2 , P3, P4, P5} in Figure[2]partitions G\. Other passage 
partitionings for graph G\ are {P 2 ,P5,P7J and {Pi,P 2 ,Pg}. 

Definition 4 (Passage Partitioning). Let G = (V, E) be a graph. V = {Pi, P 2 , . . . , P«} C pas(G) \ {0} 
is a passage partitioning if and only if{jV = E and Vi<i<j<„ PiffPj. 

Any passage partitioning V defines an equivalence relation on the set of edges. For e±, e% G E, e\ ~-p e 2 if 
there exists a P £ V with {ei, e 2 } C P. 

Lemma 5 (Equivalence Relation). Xei G = (V, E) be a graph with passage partitioning V '. ^p defines 
an equivalence relation. 

Proof. We need to prove that ~-p is reflexive, symmetric, and transitive. Let e, e', e" £ E. Clearly, e ~-p e 
because e £ E = {JV (Pisa passage partitioning). Hence, there must be a P £ P with e£V (reflexivity). 
If e ~-p e', then e' ~-p e (symmetry). If e ~-p e' and e' ~-p e", then there must be a P G P such that 
{ei,e 2 ,e3} C P. Hence, e ~-p e" (transitivity). D 

Any graph has a passage partitioning, e.g., {E} is always a valid passage partitioning. However, to decompose 
analysis one is typically interested in partitioning the graph in as many passages as possible. Therefore, 
we introduce the notion of a minimal passage. Passage Pq in Figure [2] is not minimal because it contains 
smaller non-empty passages: P3, P4, and P5. Passage P7 is also not minimal. Only the first five passages in 
Figure [2] (Pi , P 2 , P3, P4 and P5) are minimal. 

Definition 5 (Minimal Passages). Let G — (V,E) be a graph and P £ pas(G) a passage. P is minimal 
if and only if there is no non-empty passage P' £ pas(G) \ {0} such that P' C P. pas min (G) is the set of 
all non-empty minimal passages. 

Two different minimal passages cannot share the same edge. Otherwise, the difference between both passages 
would yield a smaller non-empty minimal passage. Hence, an edge can be used to uniquely identify a minimal 
passage. The fixpoint characterization given in Lemma HI suggests an iterative procedure that starts with a 
single edge. In each iteration edges are added that must be part of the same minimal passage. As shown 
this procedure can be used to determine all minimal passages. 

Lemma 6 (Constructing Minimal Passages). Let G = (V, E) be a graph. For any (x,y) £ E, there 
exists precisely one minimal passage P( x ,y) € P as min(G) such that (x,y) £ P( x , y )- 

Proof. Initially, set P := {(x,y)}. Extend P as follows: P := ((tti(P) x V) U (V x tt 2 (P))) n E. Repeat 
extending P until it does not change anymore. Finally, return Pi x ,y) — P- The procedure ends because the 
number of edges is finite. If P = ((7Ti(P) x V) U (V x 7T 2 (P))) fl E (i.e., P does not change anymore), then 
P is indeed a passage (see LemmaEJ. P is minimal because no unnecessary edges are added: if (x, y) £ P, 
then any edge starting in x or ending in y has to be included. 

To prove the latter one can also consider all passages P = {Pi, P 2 , . . . ,P n } that contain (x,y). The 
intersection of all such passages f] V contains edge (x, y) and is again a passage because of Lemma[2j Hence, 
[\V = P{x,yy □ 

The construction described in the proof can be used compute all minimal passages and is quadratic in the 
number of edges. 

pas min (Gi) = {Pi, P 2 , P3, Pa,P^\ for the graph shown in Figure IT] This is also a passage partitioning. 
(Note that the construction in Lemma KH is similar to the computation of so-called clusters in a Petri net 

ao 

Theorem 1 (Minimal Passage Partitioning). Let G = (V,E) be a graph. pas min (G) is a passage par- 
titioning. 



,P n }C P as(G)\{$}, Ui< 
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PROOF. Let pas min (G) = {Pi, P 2 , ■■■, P„}. Clearly, {Pi,P 2 
Vi<i<j<n Pi#Pj (follows from Lemma pj. 

Figure [3] shows a larger graph G2 = (V2, E 2 ) with Vi = {a, 6, . . . , 0} and £2 = {(a, 6), (6, e), . . . , (n, o)}. The 
figure also shows six passages. These form a passage partitioning. Each edge has a number that refers to 
the corresponding passage, e.g., edge (h, k) is part of passage P4. Passages are shown as rectangles and 
vertices are put on the boundaries of at most two passages. Vertex a in Figure [3] is on the boundary of P\ 
because (a, b) £ P\. Vertex b is on the boundary of Pi and Pi because (a, b) £ P\ and (b, e) £ P 2 - G2 has 
no isolated vertices, so all vertices are on the boundary of at least one passage. 




Figure 3: A passage partitioning for graph G2. 

The passage partitioning shown in Figure [3] is not composed of minimal passages as is indicated by the 
two dashed lines. Both Pi and Pq are not minimal. Pi can be split into minimal passages Pi a = {(a, b)} 
and Pib = {(c, d)}. P 6 can be split into minimal passages P^ a = {(m, /)} and Pqi, — {(n, o), (n, m)}. In fact, 
as shown next, any passage can be decomposed into minimal non-empty passages. 

Theorem 2 (Composing Minimal Passages). Let G — (V, E) be a graph. For any passage P £ pas(G) 
there is a set of minimal non-empty passages {Pi, P2, • • 
Vi<j<y< n Pi#Pj. 



,P n } C pas min (G) such that [j 1<i<n Pi = P and 



Proof. Let {Pi, P2, . . . , P„} = {P( x ,y) I {%, y) G P}- These passages are minimal (Lemma|6| and also cover 
all edges in P. Moreover, two different minimal passages cannot share edges. □ 

A graph without edges has only one passage. Hence, if E = 0, then pas(G) = {0} (just one passage), 
pas min (G) — (no minimal non-empty passages), and is the only passage partitioning. If E ^ 0, then 
there is always a trivial singleton passage partitioning {E} and a minimal passage partitioning pas min (G) 
(but there may be many more). 

Lemma 7 (Number of Passages). Let G = (V,E) be a graph with k — \pas min {G)\ minimal non-empty 
passages. There are 2 k passages and B^ passage partitioningsVA For any passage partitioning {Pi, P2, . . . , P n } 
ofG:n<k< \E\. 



1 B/ e is the fc-th Bell number (the number of partitions of a set of size k), e.g., B3 = 5, -B4 = 15, and B5 = 52 |10| , 
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Proof. Any passage can be composed of minimal non-empty passages. Hence, there are 2 k passages. E>k is 
the number of partitions of a set with k members, thus corresponding to the number of passage partitionings. 
If there are no edges, there are no minimal non-empty passages (k = 0) and there is only one possible 
passage partitioning: 0. Hence, n = 0. If E j£ 0, then pas min (G) is the most refined passage partitioning. 
There are at most \E\ minimal non-empty passages as they cannot share edges. Hence, n < k < \E\. Note 
that n > 1 if E ^ 0. □ 

Graph G 2 in Figure [3] has 2 8 = 256 passages and B s = 4140 passage partitionings. 

4. Passage Graphs 

Passage partitionings can be visualized using passage graphs. To relate passages, we first define the 
input /output vertices of a passage. 

Definition 6 (Input and Output Vertices). Let G — (V,E) be a graph and P E pas(G) a passage. 
in(P) — 7i"i (P) \ 7T2(P) art the input vertices of P, out(P) = ^(P) \ 7i"i(P) are the output vertices of P, 
and io(P) — 7Ti(P) H ^(P) are the input/output vertices of P. 

Note the difference between input, output, and input/output vertices on the one hand and the initial and 
terminal vertices of a passage on the other hand. Given a passage partitioning, there are five types of 
vertices: isolated vertices, input vertices, output vertices, connecting vertices, and local vertices. 

Definition 7 (Five Types of Vertices). Let G = (V,E) be a graph and V — {Pi,P2, . . . ,P n } a passage 
partitioning. Vi SO = V\ (7Ti(P) U ^(P)) are the isolated vertices of V , Vi„ — 7Ti(P) \ ^%{E) are the input 
vertices of V , Vout — ^(P) \ tti(P) are the output vertices of V , V con = Uj^i 7r 2(Pi) H tti(Pj) are the 
connecting vertices ofV, Vi oc = [J i ni(Pi) D 7ra (Pi) are the local vertices ofV. 

Note that V = Vi SO U Vi n U V ou t U V con U Vi oc and the five sets are pairwise disjoint, i.e., they partition V. 
In the passage partitioning shown in Figure [3] a is the only input vertex, k and o are output vertices, and 
e, i and m are local vertices. All other vertices are connecting vertices. 

Definition 8 (Passage Graph). Let G = (V 1 E) be a graph and V = {Pi, P 2 , . . . , P n } a passage partition- 
ing. (V, {(P, P') G V x V | P l> P'}) is corresponding passage graph . 

Figure [4] shows a passage graph. The graph shows the relationships among passages and can be used to 
partition the vertices V into Vi SO U Vi n U V ut U V con U Vioc- 




Figure 4: Passage graph based on the passage partitioning shown in Figure [3] 



5. Quality of a Passage Partitioning 

Passages can be used to decompose analysis problems (e.g., conformance checking and process discovery 
[2]). In the extreme case, there is just one minimal passage covering all edges in the graph. In this case, 
the graph cannot be decomposed. Ideally, we would like to use a passage partitioning V — {Pi, P2, . . . , P„} 
that is accurate and that has only small passages. One could aim at as many passages as possible in order 
to minimize the average size per passage: av(P) = - — - per passage. One can also aim at minimizing the 
size of the biggest passage (i.e., big(P) = max\<i< n \Pi\) because the biggest passage often takes most of 
the computation time. 

To have smaller passages, one may need to abstract from edges that are less important. To reason 
about such "approximate passages" we define the input as G^ = (V, it) with vertices V and weight function 
n G (V x V) — >• [— 1, 1]. Given two vertices x,y € V: ir(x,y) is "weight" of the possible edge connecting 
x and y. If tt(x, y) > 0, then it is more likely than unlikely that there is an edge connecting x and y. If 
ir(x,y) < 0, then it is more unlikely than likely that there is an edge connecting x and y. One can view 
giSlp± as the "probability" that there is such an edge. The penalty for leaving out an edge (x, y) with 
ir(x,y) = 0.99 is much bigger than leaving out an edge (x',y') with ir(x' ,y') = 0.15. The accuracy of a 
passage partitioning V — {Pi,P 2 , . . . ,P n } with E = Ui<j<„ Pi for input G^ = {V,ir) can be defined as 
acc(P) — (g.«)£E > — ^^ If acc(P) = 1, then all edges having a positive weight are included in 

some passage and none of edges having a negative weight arc included. Often there is a trade-off between 
higher accuracy and smaller passages, e.g., discarding a potential edge having a low weight may allow for 
splitting a large passage into two smaller ones. Just like in traditional graph partitioning 7, 8 , one can look 
for the passage partitioning that maximizes acc(P) provided that av(P) < r av and/or big(P) < Tbig, where 
r av and Tbig are suitably chosen thresholds. Whether one needs to resort to approximate passages depends 
on the domain, e.g., when discovering process models from event logs causalities tend to be uncertain and 
including all potential causalities results in Spaghetti-like graphs [T], therefore approximate passages are 
quite useful. 

6. Conclusion 

In this article we introduced the new notion of passages. Passages have been shown to be useful in the 
domain of process mining. Given their properties and possible applications in other domains, we examined 
passages in detail. Passages are closed under the standard set operators (union, difference, and intersection). 
A graph can be partitioned into components based on its minimal passages and any passage is composed of 
minimal passages. The theory of passages can be extended to deal with approximate passages. We plan to 
examine these in the context of process mining, but are also looking for applications of passage partitionings 
in other domains (e.g., distributed enactment and verification). 
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